JEuropaisches Patentamt 
European Patent Office 
Office europeen des brevets 



EUROPEAN PATENT APPLICATION 



21) Application number : 95300953.7 

22) Date of filing : 15.02.95 



® int. ci. 6 : H04N 7/24, H04N 7/15 



(30) Priority: 25.02.94 US 201871 

(43) Date of publication of application : 
30.08.95 Bulletin 95/35 

@ Designated Contracting States : 
DE FR GB IT 

@ Applicant : AT & T Corp. 
32 Avenue of the Americas 
New York, NY 10013-2412 (US) 

© Inventor : Cash, Glenn Lawrence 
49 Little Street 

Matawan, New Jersey 07747 (US) 



Inventor : Civanlar, Mehmet Raha 
85 Coleman Avenue 
Red Bank, New Jersey 07701 (US) 
Inventor : Gaglianello, Robert D. 
197 White Road 

Little Silver, New Jersey 07739 (US) 
Inventor : Swicker, Donald Bruce 
1701 Maxwell Drive 
Wall, New Jersey 07719 (US) 

@ Representative : Buckley, Christopher Simon 
Thirsk et al 
AT&T (UK) LTD., 

AT&T Intellectual Property Division, 

5 Mornington Road 

Woodford Green, Essex IG8 0TU (GB) 



(S) Multipoint digital video communication system. 

(57) The present invention provides a novel video 
communication system incorporating multiple 
window display. According to the present inven- 
tion, one or more video transmission nodes 
provide macroblocks of video data available in a 
plurality of resolution levels. The transmission 
node encodes the macroblocks and assigns 
macroblock identifiers (IDs) to each macrob- 
lock. A receiving node receives a plurality of 
video sequences from a plurality of transmis- 
sion nodes through a communication network. 
The receiving node operating according to the 
present invention thereafter eliminates macrob- 
locks of video data that will not be displayed, 
such as in the case of portions of video that are 
hidden or overlapped. A macroblock translator 
also transforms the macroblock ID to a new 
macroblock ID which reflects the macroblock's 
position on the display screen as dictated by the 
user's window configuration. A decoder there- 
after decompresses the video data one macrob- 
lock at a time, and provides the decompressed 
macroblock to a frame buffer. The frame buffer 
provides the window configured video data to a 
display. 
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Field of the Invention 

The present invention relates to the field of digital 
video processing and transmission, and particularly 
to multiple window digital video reception and display. 

Background of the Invention 

The acceptance of digital video compression 
standards f for example, the Motion Picture Expert 
Group (MPEG) standard, combined with the availabil- 
ity of a high-bandwidth communication infrastructure 
have poised the telecommunications market for an 
explosion of video based services. Services such as 
video-on-demand, multi-party interactive video 
games, and video teleconferencing are actively being 
developed. These and otherfuture video services will 
require a cost-effective video composition and dis- 
play technique. 

An efficient multiple window display is desirable 
for displaying the multiple video sequences produced 
by these applications to a video user or consumer. 
The implementation of such a windows environment 
would permit a user to view simultaneously several 
video sequences or images from several sources. 
The realization of a commercial multiple window vid- 
eo display is hampered by technological limitations on 
available data compression equipment. 

In digital television and other digital image trans- 
mission applications, image signals must be com- 
pressed or coded to reduce the amount of bandwidth 
required for transmission. Typically, a full screen 
frame of video may be composed of an array of at 
least 640 x 480 picture elements, or pixels, each pixel 
having data for luminance and chrominance. Under 
one standard, for example, the frames are composed 
of 720 x 480 pixel arrays. A video sequence is com- 
posed of a series of such discrete video frames, sim- 
ilar to the frames in a moving picture film. True enter- 
tainment quality video requires a frame rate of at least 
thirty frames per second. Uncompressed, the bit rate 
required to transmit thirty frames per second would 
require far more bandwidth than presently practical. 

Image coding techniques serve to compress the 
video data in order to reduce the number of bits trans- 
mitted per frame. There are several standard image 
coding techniques, each of which take advantage of 
pixel image data repetition, also called spatial corre- 
lation. 

Spatial correlation occurs when several adjacent 
pixels have the same or similar luminance (bright- 
ness) and chrominance (color) values. Consider, for 
example, a frame of video containing the image of a 
blue sky. The many pixels comprising the blue sky im- 
age will likely have identical or near identical image 
data. Data compression techniques can exploit such 
repetition by, for example, transmitting the luminance 
and chrominance data for one pixel and transmitting 



information on the number of following pixels for 
which the data is identical, or transmitting only the 
difference between adjacent pixels. Presently, spa- 
tial correlation is exploited by compression techni- 

5 ques using discrete cosine transform and quantiza- 
tion techniques. Where such data compression or 
coding is employed, each video source or transmis- 
sion node must be equipped with data encoding 
equipment and each receiving node must likewise be 

10 equipped with decoding equipment. Several video 
coding protocols are well-known in the art, including 
JPEG, MPEG1, MPEG2 and Px64 standards. 

In a multipoint video application, such as a video 
teleconference, a plurality of video sequences from a 

15 plurality of sources are displayed simultaneously on 
a video screen at a receiving node. Multiple window 
video display currently requires the use of multiple 
decoding devices. Otherwise, the video data arriving 
from multiple sources would often overload the ca- 

20 pacity of a single decoding device. Furthermore, cur- 
rently available decoding devices are not equipped to 
handle simultaneously video sequences from dispar- 
ate sources. Decoding circuitry relies on video se- 
quence context information which emanates from the 

25 source of the video. Current decoding devices cannot 
store, access and switch between several video se- 
quence contexts as would be necessary to contem- 
poraneously decode video from several sources. 
The disadvantage of the prior art, therefore, is the 

30 requirement of multiple decoding devices. At present, 
the decoder chips and chip sets, even relatively sim- 
ple ones, such as those compatible with the JPEG 
and Px64 technologies, are expensive. As a conse- 
quence, the use of multiple decoding devices pro- 

35 vides an impractical windowing solution. 

A further difficulty encountered in multiple win- 
dow video is that many sources provide video in only 
one screen display size. In fact, many sources trans- 
mit only full screen images which typically comprise 

40 640 x 480 pixels per frame. To provide truly flexible 
windowing capabilities, different users should have 
the option of invoking and viewing differently sized 
windows of the same video. Windows which comprise 
a fraction of the entire display require the image data 

45 to be filtered and subsampled, resulting in frame sig- 
nals comprising less pixels. For example, a 1/4 screen 
window, requires frame data comprising only 320 x 
240 pixels. It is therefore advantageous to make video 
data available at a plurality of window sizes or reso- 

50 lution levels. For example, the video of a participant 
in a teleconference may be made available at full 
screen resolution, 1/4 screen, 1/16 screen or 1/64 
screen, so that the other participants can choose a 
desired size window in which to view the transmitting 

55 participant. 

Under one technique of providing multiple resolu- 
tion levels, each video transmitter provides a plurality 
of video sequences, each independently containing 
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the data signal for a particular resolution level of the 
same video image. One method of generating multi- 
ple resolution video sequences would be to employ 
several encoders, one for each resolution level. The 
requirement of multiple encoders, however, as in the 
case of decoders, increases system cost. Encoders 
comprise extremely costly components in digital vid- 
eo transmission systems. 

Summary of Invention 

The present invention provides a method and 
system for encoding, transmitting, receiving, decod- 
ing and displaying a plurality of video sequences 
available at a plurality of resolution levels while min- 
imizing the requirements for encoding and decoding 
circuitry. 

According to the present invention, one or more 
video transmission nodes provide macro blocks of vid- 
eo data available in a plurality of resolution levels. 
The transmission node encodes the macroblocks and 
assigns macroblock identifiers (IDs) to each macro- 
block. A receiving node receives a plurality of video 
sequences from a plurality of transmission nodes 
through a communication network. The receiving 
node operating according to the present invention 
thereafter eliminates macroblocks of video data that 
will not be displayed, such as in the case of portions 
of video that are hidden or overlapped. A macroblock 
translator also transforms the macroblock ID to a new 
macroblock ID which reflects the macroblock's posi- 
tion on the display screen as dictated by the user's 
window configuration. A decoder thereafter decom- 
presses the video data one macroblock at a time, and 
provides the decompressed macroblock to a frame 
buffer. 

The macroblock elimination and macroblock 
translation steps may be executed either at the re- 
ceiving node or at a bridge connected to the commu- 
nication network. The present invention may provide 
the above described functionality in a variety of stan- 
dard video compression environments, including 
JPEG, MPEG1, MPEG2 and Px64. Alternative em- 
bodiments of the present invention support intra- 
frame, interframe, and motion compensated video 
compression. 

The above discussed features, as well as addi- 
tional features and advantages of the present inven- 
tion, will become more readily apparent by reference 
to the following detailed description and the accom- 
panying drawings. 

Brief Description of the Drawings 

FIG. 1 illustrates an example of a video telecon- 
ferencing system which may be utilized in con- 
junction with method of the present invention; 
FIG. 2 illustrates a multipoint encoder circuit op- 



erable to transmit compressed video data in four 
different resolution levels from a single input vid- 
eo sequence without introducing appreciable de- 
lay according to one aspect of the present inven- 

5 tion; 

FIG. 3 illustrates a functional block diagram of a 
multipoint decoder circuit which is capable of re- 
ceiving, decoding and displaying video data from 
a plurality of sources simultaneously using the 

w method of the present invention; 

FIG. 4 illustrates a detailed block diagram of a 
macroblock translator and eliminator, a decoder 
and related circuitry which are capable of practic- 
ing one aspect of the present invention in order to 

15 enable multiple window display of a plurality of 

video images corresponding to a plurality of com- 
pressed data streams; 

FIG. 5 illustrates an example of a network em- 
ploying a bridge which may be used in conjuno 
20 tion with at least one encoder and at least one de- 
coder according to the present invention to pro- 
vide a high efficiency multiple user video telecon- 
ferencing system; and 

FIG. 6 illustrates an alternative embodiment of a 
25 bridge which may be used in conjunction with a 

system employing at least one encoder and at 
least one decoder according to the present inven- 
tion to provide a high efficiency multiple user vid- 
eo teleconferencing system. 

30 

Detailed Description of the Drawings 

FIG. 1 illustrates an example of a multipoint video 
system suitable for use in conjunction with the meth- 

35 ods of the present invention. While the exemplary 
system in FIG. 1 shows a three participant video tel- 
econferencing system, any number of participants 
that will reasonably fit on a video display screen are 
possible using the present invention. Likewise, other 

40 sources of digital video may replace one or more of 
the participants, such as a digital chalkboard or video 
laser disk. 

Three nodes 12, 22 and 32 are connected 
through communication lines 40, 50 and 60, respec- 

45 tively , to a network 70. The node 1 2 comprises a work 
station including a video display 10 and a video cam- 
era 14. A mouse 15 or similar input device is connect- 
ed to the video display 10. Display windows 17 1( 17 2 
and 1 7 3 appear on the video display 1 0. Nodes 22 and 

50 32 also comprise work stations and include similar 
video displays 20 and 30 and similar video cameras 
24and 34, respectively. Display windows 27 1f 27 2 and 
27 3 appear on video display 20 and the display win- 
dows 37 1t 37 2 and 37 3 appear on video display 30. 

55 Each of the work stations 1 2, 22 and 32 operates 

as both a video transmission node and a video receiv- 
ing node, in other words, they both provide video to 
and receive video from the other nodes. The video 
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cameras 14, 24 and 34 provide video signals for 
transmission and the video displays 10, 20 and 30 re- 
ceive and display video signals. To this end, each 
work station 12, 22 and 32 includes a multipoint en- 
coder circuit, not shown, which enables transmission 5 
of digital video data to a plurality of nodes, and a mul- 
tipoint decoder circuit, not shown, which enables si- 
multaneous reception and display of digital video sig- 
nals from a plurality of nodes. According to the pres- 
ent invention, one or more of the work stations 12, 22 10 
and 32 may be equipped with a multipoint encoder cir- 
cuit similar to the one described in connection with 
FIG. 2 below. Furthermore, at least one of the work 
stations may be equipped with a multipoint decoder 
circuit similar to the one described in connection with 15 
FIGS. 3 and 4, below. 

In the example illustrated in FIG. 1, user A, user 
B and user C establish and participate in a video tel- 
econference. Users A, B and C are located at work 
stations 12, 22 and 32 respectively. In operation, work 20 
stations 12, 22 and 32 establish intercommunication 
by connecting through communication lines 40, 50 
and 60 to the broadband network 70. The network 70 
may comprise any suitable data communications in- 
frastructure, such as a local area network, wide area 25 
network, PBX or digital cable TV system. 

Each user then transmits a digital video signal 
comprising a plurality of video sequences, each con- 
taining a different resolution level of the image cap- 
tured by cameras 1 4, 24 and 34 to the network 70. For 30 
example, user A provides a camera i mage of his upper 
torso and face at full screen (640 x 480 pixels), 1/4 
screen (320 x 240 pixels), 1/16 screen (160 x 120 pix- 
els) and 1/64 (80 x 60 pixels) screen resolution. Be- 
cause of the manner in which video data is segment- 35 
ed, which will be discussed further below, the 1/64 
screen resolution may actually comprise either 80 x 
56 or 80 x 64 pixels. User B and user C may provide 
like video sequence signals of their video images. 

Users A, B and C then select a display preference 40 
by choosing the size and placement of windows in 
which the video from each participant, A, B and C will 
be viewed. For example, user A may, for example, se- 
lect the window illustrated by window 1 1\ to view his 
own video signal, window 17 2 to view user B*s video 45 
signal, and window 17 3 to view user C's video signal. 
User B and user C likewise select windows indepen- 
dent of user A and each other. As a consequence, 
while user A selects to a 1/16 screen image of each 
participant, user B selects to view 1/4 screen of her- 50 
self, and a 1/16 screen image of both user A and user 
C. User C selects yet a different configuration. 

Once the selections are made, each work station 
12, 22 and 32 establishes a virtual circuit connection 
to receive the selected video signals of each partici- 55 
pant over the network 70. For example, user A estab- 
lishes a connection to receive the 1/16 resolution vid- 
eo data transmission of both B and C, while user B es- 



tablishes a connection with the 1/16 video sequence 
of user A and the 1/4 video sequence of user B. In the 
example discussed above, the full screen resolution 
data sequence from all the users is not requested by 
any participants and therefore is not transmitted over 
the network. This reduces the number of transmitted 
bits, and consequently reduces the load on the net- 
work. 

As stated above, each of the nodes 12, 22 and 32 
provides a digital signal comprising a plurality of re- 
solution level video sequences to the network 70. A 
suitable multipoint encoder circuit which is operable 
to produce the plurality of video sequence digital sig- 
nals from an analog video input is illustrated below in 
connection with FIG. 2. 

A multipoint encoder circuit 200 capable of pro- 
ducing up to four sequences of video data represent- 
ing four different resolutions of video receives a Na- 
tional Television Systems Committee (NTSC) analog 
video signal on input line 202. The source of the signal 
may suitably be one of the video cameras 14, 24 or 
34 from FIG. 1 , or other analog television raster signal 
source. The multipoint encoder circuit 200 operates in 
accordance with the MPEG2 video compression stan- 
dard, but may readily be adapted to any other suitable 
standard by one of ordinary skill in the art. 

The NTSC input line 202 is connected to an A/D 
converter 204. The A/D converter 204 may suitably be 
an analog to digital signal conversion device capable 
of digitizing a 525 line NTSC analog signal into a dig- 
ital signal of 640 x 480 pixels. Devices with such ca- 
pabilities are readily available. The A/D converter 204 
is connected to a first anti-aliasing filter 208 and a 
third strip memory 220. 

The first anti-aliasing filter 208 is coupled to both 
a second anti-aliasing filter 21 2 and a first strip mem- 
ory 216. The second anti-aliasing filter 212 is further 
coupled to a second strip memory 218. Each of the 
first and second anti-aliasing filters 208 and 212 com- 
prises a low-pass digital filter. 

A controller 224 is operably connected to write 
control pins or connections of each of the first, sec- 
ond and third strip memories 216, 218 and 220. The 
controller is programmed to effect subsampiing by se- 
lectively triggering the write control on each memory 
216, 21 8 and 220. The selective triggering allows only 
certain intervals of data to be written therein. The con- 
troller 224 may suitably comprise a microprocessor or 
sequence controller, the programming of which would 
be apparent to one of ordinary skill in the art. The con- 
troller 224 is further operably connected to a control 
connection of a multiplexer 228. The multiplexer 228 
may comprise any suitable time division multiplexing 
device. 

The strip memories 216, 218 and 220 are oper- 
ably coupled to the multiplexer 228. The multiplexer 
228 is thereafter connected to an encoding device 
240. The encoding device 240 may suitably comprise 
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a video data encoding chip or chip set which encodes 
data according to the MPEG2 standard. The encoding 
device 240 will be further coupled to an reference 
frame store 232 to enable interframe coding. The 
controller 224 is also connected to the reference 5 
frame store 232. Encoding devices using only intra- 
f rame coding, such as JPEG, need not include the ref- 
erence frame store 232. 

The encoding device 240 is coupled to a buffer 
244. The buffer 244 output connection is coupled to 10 
a packetizer 248. The buffer 244 and packetizer 248 
are both connected to the controller 224. 

The encoder circuit 200 described above oper- 
ates in the following manner to provide multiple video 
sequences, each providing a distinct resolution of the 1 5 
same video image. 

A source of video, not shown, provides NTSC vid- 
eo signal to the A/D converter 204. The A/D converter 
204 samples the signal at 640 samples per line. While 
NTSC signals comprise a 525 line video signal, only 20 
approximately 480 lines contain relevant video infor- 
mation. As a result, the A/D converter 204 may create 
a 640 x 480 digital representation of the video signal. 
Each sample comprises video data for a picture ele- 
ment or pixel. Each pixel may be considered as hav- 25 
ing an address of (x,y) where x = 0 through 639 and 
y = 0 through 479. 

It should be noted that NTSC signals do not pro- 
vide the 480 visible lines sequentially. Instead, the 
signal is divided into two fields, each containing the 30 
information for every other line. In other words, one 
field contains all the even numbered lines and the 
other field contains all the odd numbered lines. One 
field is transmitted in its entirety before the other field 
is transmitted. Because the A/D converter 204 digitiz- 35 
es the NTSC signal as it is received, the resulting dig- 
ital signal is likewise divided into two fields. 

The A/D converter 204 provides the full 640 x 480 
resolution video sequence bit stream to both the first 
anti-aliasing filter 208 and the third strip memory 220. 40 
The filter 208 functions as a digital low pass filter and 
removes high frequency components that could 
otherwise corrupt a 2 to 1 subsampling operation. 
The filtered digital signal is then provided to the first 
strip memory 216. The controller 224, however, by 45 
selectively providing write enable signals, only allows 
every other pixel to be written to the strip memory 
216. The strip memory 216 will therefore only hold 
320 x 480 pixels of video data. Moreover, because the 
data is provided in two fields, the strip memory will so 
contain, in sequential order, two fields of 320 x 240 
pixels of video data, one with the odd rows and one 
with the even rows. Either one of these fields alone 
comprises a 1/4 screen resolution frame of a video 
sequence. As a consequence, the field structure of 55 
the frame signal inherently subsamples the frame 
vertically at a 2 to 1 rate. 

The A/D converter 204 also provides the full 640 



x 480 frame signal to the third strip memory 220 to al- 
low for a full screen resolution transmission. The two 
fields of video data must of course be interleaved pri- 
or to display to build an entire frame. This task, how- 
ever, is accomplished at each receiving node, such as 
work station 12, 22 or 32 of FIG 1. 

Returning to the first anti-aliasing filter 208, the 
filter 208 also provides the filtered digital signal to an- 
other anti-aliasing filter 212. The second filter 212 
further filters the signal already filtered by the first fil- 
ter 208 to allow for higher order subsampling, namely 
4 to 1 and 8 to 1. 

The second anti-aliasing filter 212 provides the 
doubly filtered digital signal to the second strip mem- 
ory 218. The controller 224 provides write enable sig- 
nals to the second strip memory 218 to effect sub- 
sampling of the filtered signal. The controller 224 ef- 
fects 4 to 1 subsampling of the first field and 8 to 1 
subsampling of the second field. The controller 224 
accomplishes the subsampling by allowing every 
fourth pixel to be written for the first field of each 
frame and every eighth pixel to be written for the sec- 
ond field of each frame. 

The controller 224 additionally achieves vertical 
subsampling by eliminating every other line in the 
first field and 3 out of every 4 lines for the second 
field. As a result, the second strip memory 218 will 
contain a 160 x 120 pixel representation of the image 
(the first field) and 80 x 60 pixel representation of the 
image (the second field). Other suitable mechanisms 
for vertical and horizontal subsampling may be imple- 
mented and will be apparent to those of ordinary skill 
in the art 

The first, second and third strip memories 216, 
218 and 220, therefore, contain full screen, 1/4 
screen, 1/16 screen and 1/64 screen digital frame 
data. The memories 216, 218 and 220 provide data in 
the form of blocks called macroblocks or slices to the 
multiplexer 228. Macroblocks are blocks of pixels, 
typically a16x8 or 16x16 pixel array, which together 
constitute an entire frame of video data. One or more 
macroblocks comprise a slice, which are used in 
MPEG-based systems. Slices and macroblocks are 
processed in the same manner. For purposes of sim- 
plicity, generic term macroblock will be employed to 
describe either macroblocks or slices. 

In typical video signal applications, macroblocks 
are transmitted sequentially from left to right, top to 
bottom. Thus, for example, 2400 16x8 macroblocks 
would constitute an entire 640 x 480 pixel frame of a 
video sequence. 

The macroblocks at the multiplexer 228 consti- 
tute four independent video sequences, each con- 
taining the original video sequence at a different re- 
solution level. The multiplexer 228, under the control 
of the controller 224, provides a single stream of time- 
division multiplexed macroblocks of all four resolu- 
tions to the encoding device 240. The encoding de- 
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vice encodes or compresses each macroblock indi- 
vidually. Because MPEG-based compression sys- 
tems employ interf rame coding, the encoder 240 will 
receive reference frame information from the refer- 
ence frame store 232. In interf rame coding applica- 5 
tions, the encoder relies on information from previous 
(and in some cases future) frames to encode a par- 
ticular frame. The use of the reference frame store 
232 to provide the previous (or future) frame data to 
the encoder is well-known in the art. According to the 10 
present invention, however, the reference frame store 
232 must provide reference frame information for a 
plurality of video sequences, in other words, each re- 
solution. 

The reference frame store 232 must therefore 15 
contain reference frame information for ail four reso- 
lution level video sequences. The controller 224 di- 
rects the encoding device 240 to the proper reference 
frame by bank-switching the reference frame store 
232. For example, if the current macroblock entering 20 
the encoding device is from the 1/4 resolution video 
sequence, the controller directs the encoding device 
240 to the 1/4 resolution reference frame stored with- 
in the reference frame store 232. If the next macro- 
block is from the 1/64 resolution video sequence, the 25 
controller appropriately redirects the encoding device 
240. 

The encoding device 240 as described encodes 
or compresses data for all four resolution video se- 
quences when configured as shown in FIG. 2 and de- 30 
scribed herein. The encoder must, therefore, for ev- 
ery video frame received at input 202, encode data 
corresponding to one full screen frame plus data for 
1/4 frame, 1/16 frame and 1/64 frame. It should be 
noted, however, that the amount of data encoded per 35 
frame is not extensively increased, and thus the does 
not introduce appreciable delay or unduly tax the ca- 
pabilities of currently available encoding devices. 

The encoding device 240 further adds macro- 
block identifiers (IDs) to each macroblock. The mac- 40 
roblock ID comprises data representative of the mac- 
roblock's position in a frame of its video sequence. 
Macroblock IDs are typically assigned sequentially 
left to right, top to bottom in each frame. The macro- 
block IDs are specific to each video sequence. There- 45 
fore, the full screen resolution video sequence will 
have macroblock IDs numbering from 0 to 2399 while 
the 1/4 screen resolution sequence will have its own 
macroblock IDs numbering from 0 to 599 and so on. 

The encoding device 240 provides the time- so 
division multiplexed, compressed macroblocks, now 
comprising video data and their macroblock IDs, to a 
buffer 244. The controller 224 then controllably di- 
rects the buffer 244 to write macroblocks to the 
packetizer 248. 55 

It should be noted that in some compression stan- 
dards, such as the JPEG standard, macroblock IDs 
are not utilized. In such a circumstance, the controller 



224 may simply create and write the macroblock IDs 
to the compressed macroblocks at the packetizer. 
Such an adaptation would be readily implemented by 
one of ordinary skill in the art 

While the above encoder circuit 200 may be em- 
ployed to transmit video signals from a node such as 
work station 12 of FIG. 1, a multipoint decoder circuit 
is required to receive digital video signals from sev- 
eral sources. Multipoint decoder circuit 400 described 
below and illustrated in FIG. 3 may be incorporated by 
work station 12 of FIG. 1 to enable reception of mul- 
tiple digital video sequences. 

Referring now to FIG. 3, the multipoint decoding 
circuit 400 illustrated therein employs a novel multi- 
point decoding method in accordance with one aspect 
of the present invention. The circuit 400 as described 
below is compatible with systems using interframe 
encoded MPEG2 data with the motion compensation 
functionality suppressed. Other suitable interframe 
compression techniques may similarly be employed. 

The circuit 400 illustrated in FIG. 3 receives, de- 
codes and displays a plurality of video sequences 
provided through a network 402 as data packet 
stream. A depacketizer 406 receives from the net- 
work 402 packet data for a plural ity of video sequenc- 
es containing, for example, the three video images of 
users A, B and C from FIG. 1. The depacketizer 406 
provides three compressed video data streams cor- 
responding to the three video sequences to a macro- 
block translation and elimination (T/E) circuit 410. 
The macroblock T/E circuit 410 both eliminates mac- 
roblocks of video data that will not be displayed and 
translates the position of the macroblocks data to 
match the user's configuration. Macroblock elimina- 
tion is required because in a windows environment, 
certain portions of video window may overlap^por- 
tions of another window. As a result, if a given mac- 
roblock is contained within the portion of a window 
that is hidden from view or occluded, the macroblock 
T/E circuit 410 eliminates that macroblock data. 

The macroblock T/E circuit 410 thereafter trans- 
lates each macroblock's positional information to 
match the user-defined windows, such as windows 
17 1( 17 2 and 17 3 . To provide a versatile, user- 
definable windows environment the macroblock pos- 
ition must be converted to conform to the receiving 
user's window configuration. The T/E circuit 410 
serves to direct the macroblocks received to the prop- 
er portion of the screen. When a video sequence is 
transmitted by an encoder such as the one illustrated 
in FIG. 2, each macroblock contains a macroblock ID 
defining its position on the screen relative to other 
macroblocks in the video sequence. Macroblock IDs 
are ordinarily employed by receiving nodes to place 
the macroblock within the display screen. The macro- 
block ID is therefore translated to a new macroblock 
ID reflecting its user-defined display screen position. 

If the system receives macroblock video data 
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without macroblock IDs, the packetizer 406 in con- 
junction with the control circuitry may add the video 
sequence or source macroblock ID by reference to 
the macroblock's incoming sequence, as long as the 
macroblock IDs are received sequentially with re- 5 
spect to each video sequence. In other words, each 
macroblock from each source is simply assigned its 
sequence number with a frame of a video sequence 
as it is received by the packetizer 406. 

The T/E circuit 410 thereafter provides the par- 10 
tially occluded and translated video sequences, one 
macroblock at a time, to a decoding device 414. The 
macroblocks from each of three video sequences are 
intermixed randomly within the bit stream. The par- 
tially occluded bit stream includes only the data for 15 
the visible portion of the display. 

One important feature of the present invention is 
that the T/E circuit 410 eliminates invisible video data 
prior to decoding or decompression. By eliminating 
occluded macroblocks prior to decoding, the decoder 20 
414 performs the decoding function on the same or 
lesser quantity of data for several windows as it would 
for one full screen. According to the present invention, 
therefore, a single decoder may support any number 
of windows without overtaxing its processing capabil- 25 
ities. Further details regarding the T/E circuit 410, 
used in conjunction with the decoding device 414, are 
discussed below in connection with FIG. 4. 

The decoding device414 thereafter decompress- 
es or decodes the macroblocks suitable for storage 30 
and eventual display by the display controller 418, 
producing a stream comprising three video sequenc- 
es of macroblocks. Any suitable decoder, including 
JPEG, MPEG1, MPEG2 or Px64 type decoders may 
be configured according to the present invention to 35 
perform the functions of the decoder 414. 

The decoding device 414 must, however, be con- 
figured to be context switchable. In normal operation, 
a decoding device relies on certain overhead informa- 
tion generally provided in the transmission of each 40 
video sequence. As a result, the decoder must be con- 
figured to switch overhead parameters to correspond 
with the context of each video sequence being decod- 
ed. The decoder 41 4 thereby has the ability to receive 
and decode a plurality of incoming video sequences 45 
from unrelated sources. A detailed discussion of the 
T/E circuit 410 and the context-switchable decoder 
circuit 414 is provided below. 

FIG. 4 illustrates a circuit 500 comprising a de- 
tailed embodiment of the T/E circuit 41 0 and decoder 50 
circuit 414 discussed above in connection with FIG. 3. 
An input buffer 502 is connected to a RAM-based 
translation table 504. A control processor 506 is op- 
erably connected to the RAM-based translation table 
504. 55 

The RAM-based translation table 504 is in turn 
connected to a translated address buffer 509. The 
RAM-based translation table 504 is further connect- 



ed to a macroblock data buffer 5 1 0 for providing com- 
pressed video data without a macroblock ID address 
thereto. The buffers 509 and 510 may suitably com- 
prise first in first out (FIFO) buffers containing write 
control pins or connections. 

The RAM-based table 504 is further connected to 
a write control signal buffer 508. The write control sig- 
nal buffer 508 is further connected to the write control 
pins or connections of both the translated address 
buffer 509 and the macroblock data buffer 510. The 
translated address buffer 509 is coupled to a frame 
buffer 518. 

The macroblock data buffer 510 is coupled to a 
decoding device 512. The decoding device 512 is a 
chip or chip set operable to decode or decompress 
video data macroblocks according to the MPEG2 vid- 
eo compression standard. In other compression stan- 
dard environments such as JPEG, MPEG1 or Px64, 
other encoding devices may suitably be employed as 
appropriate. The decoder 512 is further coupled to 
both a context random access memory 514 and an 
reference frame storage device 516. The reference 
frame store 516 is configured to enable MPEG2 inter- 
frame coding in a manner well-known in the art. 

The decoder 512 is operably connected to a video 
frame buffer 518. The frame buffer 518 includes suf- 
ficient memory for at least one frame of video data. 
The frame buffer 518 is then connected to the display 
controller 418 of FIG. 3. 

In operation, the input buffer 502 receives a bit 
stream of compressed macroblocks for a plurality of 
video sequences along with their corresponding mac- 
roblock IDs from a de packetizer such as depacketizer 
506 from FIG. 4. The plurality of video sequences 
emanate from one or more video transmission nodes, 
such as work stations 12, 22 and 32 of the video tel- 
econference illustrated in FIG. 1 . The video transmis- 
sion nodes transmit a digital video sequence at any 
suitable frame rate of 30 frames per second or less. 
The input buffer 502 provides the compressed mac- 
roblocks and their IDs to the RAM-based translation 
table 504. 

The RAM-based translation table 504 converts 
the incoming macroblock ID to a new macroblock ID 
that correlates with the window configuration defined 
by the user. The RAM-based table 504 provides new 
macroblock block IDs associated with each macro- 
block's user configured position. The control proces- 
sor 506 provides updates to the RAM-based table 
based on changes in the user configuration. The con- 
trol processor 506 may suitably be any microproces- 
sor and related circuitry. The control processor 506 
may provide other timing and control functions inher- 
ent to the circuit in FIG. 5, the implementation of 
which would be readily apparent to one skilled in the 
art. 

The control processor 506 and translation table 
504 interact together as follows to effect macroblock 
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translation and elimination. The translation table 504 
comprises a memory device with a plurality of acces- 
sible addresses. The translation table 504 contains 
one memory address for each macroblock of each in- 
coming video. According to one embodiment of the 
present invention, the table's memory locations may 
suitably be indexed by the incoming macroblock's 
source or video sequence and macroblock ID. For ex- 
ample, the first macroblock of a frame from the video 
sequence provided by user A of FIG. 1 corresponds 
to one memory location while the first macroblock of 
a frame from the video sequence provided by user C 
of FIG. 1 corresponds to another location. 

When the receiving node user defines a plurality 
of windows into which one or more video sequences 
will be displayed, the control processor 506 calcu- 
lates a translated macroblock position corresponding 
to each macroblock received. The control processor 
506 then writes the new macroblock position data or 
macroblock ID to each incoming macroblock's table 
location. 

For example, consider the video teleconference 
of FIG. 1 wherein the three video sequences are dis- 
played in three windows on each of the displays 10, 
20 and 30. The macroblocks in each video sequence 
are defined at the transmission end to be displayed 
in a standard location, for example, extending to the 
right and downward from the upper left hand corner 
of the video screen. The upper left hand corner has 
a pixel address of (0,0). Because each video se- 
quence is so indexed, the upper left hand corner mac- 
roblocks from each of the three video sequences all 
have a macroblock ID of 0, which is located at pixel 
address (0,0). 

Each user, however, as discussed above in con- 
nection with FIG. 1, may define windows in different 
areas of his or her display. As a result, while a user's 
first window may start at pixel address (0,0), the sec- 
ond and third may start at (160,60) and (320,160), re- 
spectively, extending down and to the right. In gener- 
al, the macroblock beginning at (160,80) corresponds 
to macroblock ID 41 0 while the macroblock beginning 
at (320,160) corresponds to macroblock ID 820. 

The control processor 506 provides the transla- 
tion information, in other words, the new macroblock 
IDs to the RAM-based table 504. The translation table 
504 is indexed by each macroblock's video sequence 
and its macroblock ID as transmitted. According to 
the example described above, therefore, the control 
processor 506 would write macroblock ID 0 to the ta- 
ble position corresponding to the first video se- 
quence's macroblock 0, macroblock ID 410 to the ta- 
ble position corresponding to the second video se- 
quence's macroblock 0, and macroblock ID 820 to the 
table position corresponding to the third video se- 
quence's macroblock 0. 

The control processor 506 further provides the 
macroblock elimination information to the RAM-ba- 



sed translation table 504. In windowing applications, 
portions of windows may overlap, such as windows 
27 1t 27 2 and 27 3 in FIG. 1. The occluded video data 
need not be displayed or decoded. The control proc- 

5 essor 506 determines which macroblocks in which 
video sequences are occluded and provides this in- 
formation to the RAM-based translation table 504. In 
one implementation, the control device 506 may suit- 
ably provide a data flag to an occluded macroblock's 

w table address. The data flag signifies that the macro- 
block is to be eliminated. As a result, when an occlud- 
ed macroblock accesses its translation table ad- 
dress, the translation table 504 does not provide a 
macroblock ID but instead an eliminate macroblock 

is flag. 

The translation table 504 thereafter provides the 
compressed macroblock data from the plurality of vid- 
eo sequences to the macroblock data buffer 510. The 
results of the look-up in the RAM-based translation 

20 table 504, in other words, either the new macroblock 
ID or an eliminate macroblock flag, are provided to 
the write control signal buffer 508 and the translated 
address buffer 509. If the results of the look-up pro- 
duce a valid macroblock ID, then the write control sig- 

25 nal buffer 508 provides a write control signal to both 
the macroblock data buffer 510 and the translated ad- 
dress buffer 509. If the table address contains an 
eliminate macroblock flag, then no write signal is pro- 
vided. 

30 If no write signal is provided because the macro- 

block is occluded, the video data is not written to the 
macroblock data buffer 510 and contents of the look 
up table address are not written to the translated ad- 
dress buffer 509. Because neither the macroblock ID 

35 nor the macroblock video data are written, the mac- 
roblock is effectively eliminated. 

If, however, the write control signal buffer 508 
provides a write signal, the macroblock video data is 
written to the data buffer 510. The contents of the 

40 macroblock data buffer 5 1 0 are thereafter provided to 
the decoding device 512. Additionally, the contents of 
the translation table address, in other words, the mac- 
roblock ID, are written to the translated address buf- 
fer 509. The decoder 512, in conjunction with the con- 

45 text RAM 514 and the reference frame storage device 
516, decompresses or decodes the macroblock video 
data and provides the decoded macroblock data to 
the frame buffer 518. When the decoded macroblock 
video data is provided to the frame buffer 518, the 

so translated address buffer provides the new macro- 
block ID to the frame buffer 518. 

The context RAM 514 provides the decoding de- 
vice 512 with certain parameters external to the mac- 
roblocks concerning the macroblock's video se- 

55 quence. Under most standards, a certain amount of 
overhead information is associated with each video 
sequence. For example, the quantization factor used 
in encoding macroblocks in the particular video se- 
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quence is contained within the overhead information. 
Because the decoding device 512 receives macro- 
blocks originating at a plurality of sources, in other 
words, in a plurality of video sequences, the overhead 
information required by the decoding device 512 will 
change from macroblock to macroblock. The context 
RAM 514 provides the appropriate context informa- 
tion for the current macroblock being decoded by the 
decoding device 512. The video sequence context or 
overhead information is defined by the pertinent com- 
pression standard. For example, in the case of 
MPEG-based systems, see Generic Coding of Mov- 
ing Pictures and Associated Audio, Recommendation 
H.262, ISO/IEC 13818-2 (Committee Draft) for a de- 
scription of required video sequence context informa- 
tion. 

The frame buffer 516 stores the macroblock vid- 
eo data at an address corresponding to the new mac- 
roblock ID as provided by the translated address buf- 
fer 509. In this manner, the decoded video data is 
placed in a location within the frame buffer 516 that 
reflects the location of the window within the screen 
in which it will appear. The frame buffer 51 6 stores the 
video data as it is to appear on the video screen of the 
receiving node user. The frame buffer 516 provides 
the video data to the display controller 418 of FIG. 3. 

In an alternative embodiment of the above circuit 
500 modified to be suitable for use with intraframe 
JPEG coding, the reference frame buffer 516 is elim- 
inated. The JPEG video coding standard employs 
only intraframe coding. In intraframe coding, the data 
compression is achieved solely by exploiting spacial 
correlation within a single frame of a video sequence. 
As a result, the decoder 512 does not need to access 
prior (or future) frames of video and can therefore 
perform the decoding function without the reference 
frame buffer 51 6. Moreover, systems employing other 
video coding standards, such as MPEG standards, 
can force only intraframe coding, although at a loss 
of efficiency. Such systems could likewise avoid the 
need for the reference frame buffer 516. 

Another modified embodiment of the circuit 500 
described above is suitable for use with motion com- 
pensated video signals. Motion compensation is an- 
other form of video compression encoding wherein 
frame video data is obtained from displaced areas of 
previous or future frames in the sequence, which 
takes into account the motion of objects within the vid- 
eo image. In motion compensation compression, mo- 
tion vectors are transmitted which direct the decoder 
to the location within the reference frame from which 
the current macroblock has moved. The MPEG2 stan- 
dard, for example, supports such motion compensa- 
tion. If motion compensation is utilized, the reference 
frame buffer 516 would further include standard sup- 
port circuitry for motion compensation decoding. 
Such circuitry would be well known to one of ordinary 
skill in the art 



In addition, however, receiving node work sta- 
tions containing such an embodiment of the circuit 
500 must suppress occlusion and, hence, window 
overlap. Because motion compensation utilizes the 

5 displaced image data from previous or future refer- 
ence frames, no macrobiocks can be occluded or 
eliminated. Otherwise, the decoder may attempt to 
reference data for an image that had moved from a 
hidden macroblock to a visible macroblock. The de- 

10 coder would incur errors if the motion vector pointed 
to such a previously occluded macroblock. The work 
station 12 may in any event be configured, or any 
other suitable method may be used, to prevent a user 
from defining overlapping windows. Consequently, 

is this alternative embodiment supports MPEG2 motion 
compensation but restricts the receiving node user's 
windowing freedom. 

The multipoint encoder circuit illustrated in FIG. 
2 and the multipoint decoder illustrated in both FIGs. 

20 3 and 4 are suitable for use in nodes comprising a vid- 
eo teleconferencing system such as the one dis- 
cussed above in connection with FIG. 1. 

Referring again to FIG. 1, one draw back to the 
network configuration illustrated therein is that to re- 

25 ceive video from various sources, each user must es- 
tablish a virtual circuit connection over the network to 
the particular resolution of each other participant with 
whom he desires to establish a video conference. As 
a consequence, if any of the users A, B or C changes 

30 the window size or resolution to be viewed of one of 
the other participants, a new virtual circuit must be 
established on the network and the old one terminat- 
ed. Such ongoing connection and disconnection op- 
erations can be time consuming and otherwise incon- 

35 venient. To eliminate such problems, alternate config- 
urations may be implemented to provide multiple re- 
solution connections to remote users. A few of such 
alternative embodiments are discussed in connec- 
tion with FIGS. 5 and 6 below. 

40 FIG. 5 illustrates an alternative embodiment of a 
system 600 for providing video teleconferencing ser- 
vice wherein a network bridge 608 is provided to re- 
duce the connection and disconnection requirements 
of the communication equipment at end-points 610, 

45 612 and 614. The nodes 610, 612, and 614 may rep- 
resent the work station and display equipment of 
users A, B and C such as the work stations 1 2, 22 and 
32 illustrated in FIG. 1. The nodes 610, 612 and 614 
may operate both as transmission nodes and receiv- 

50 ing nodes. Alternatively, one or more of the users may 
be replaced by a video data storage device of some 
sort from which the remaining users access video 
data. The nodes 610, 612 and 614 are connected 
through the network 616 to a network bridge 608. 

55 The network bridge 608 comprises a dedicated 

piece of hardware that functions both as a network 
node and a network switch. The network bridge 608 
functions as a node because it establishes virtual cir- 



9 



17 



EP0 669 765 A2 



18 



cuit connections to the other nodes 61 0, 61 2 and 614. 
The network bridge functions as a switch because it 
establishes logical connections between the various 
nodes to which it is connected, as discussed below. 
The network bridge 608 may suitably comprise an 5 
asynchronous transfer mode (ATM) switching circuit, 
the implementation of which is well known to those of 
ordinary skill in the art. 

In operation, one or more participants in a video 
conference, for example, users A, B and C at nodes 10 
610, 612 and 614 establish virtual circuit connections 
to both transmit to and receive from the bridge. Each 
node provides its video signal to the bridge 608. If any 
of the nodes 610, 612 and 614 are capable of provid- 
ing multiple resolution levels, the multiple resolution 15 
video sequences are all sent to the network bridge 
608. The network bridge 608 then connects the 
users' transmissions to each node 610, 612 and 614 
as requested by each user. 

In this manner, each user need not establish a vir- 20 
tual circuit connection with each other participant 
over the network 616. Furthermore, if one of the 
users desires to change the display window size, and 
hence resolution, of one of the participants, the user 
does not need to disconnect one virtual circuit with 25 
the participant and establish another. Instead, the 
bridge merely connects the user to the video se- 
quence of the appropriate resolution level. By elimin- 
ating the connection/disconnection requirements of 
the nodes the hardware cost can be reduced and the 30 
implementation simplified. 

FIG. 6 illustrates an alternative embodiment of a 
bridge which may be used in place of the standard 
ATM switch discussed above in connection with FIG. 
6. The bridge 700 includes a multicast bus 708 which 35 
is capable of establishing a plurality of virtual circuits 
to several nodes connected over a network connec- 
tion 704. The multicast bus 708 comprises a data bus 
that can receive data from several sources. The mul- 
ticast bus 708 is coupled to a plurality of logical bridge 40 
circuits 712 1 . . . 71 2 n . Each logical bridge circuit 
712i . . .71 2 n is capable of establishing one virtual cir- 
cuit output connection to a receiving node, not shown. 
As a result, a particular bridge 700 having logical 
bridge circuits 71 2i . . .71 2 n can provide video signals 45 
to up to n nodes. 

The first logical bridge circuit 71 2-j is described 
below in detail. The remaining logical bridge circuits 
71 2 2 . . .71 2 n contain the same elements as the first 
circuit 71 2 1 and perform the same relative functions. so 
The first circuit 712i includes a depacketizer 716i 
connected to the multicast bus 708. The depacketizer 
71 6i comprises the circuitry required to convert net- 
work video data packets into streams of compressed 
video macroblocks. 55 

The depacketizer 71 6^ is connected to a macro- 
block T/E circuit 71 which may suitably comprise a 
circuit similar to the T/E circuit 41 0 from FIG. 3 above. 



The T/E circu it 71 8 , is connected to a packetizer 720 , . 
The packetizer 720 1 provides packets of video data to 
the virtual circuit connection to the logical bridge cir- 
cuit 712,. 

In operation, the bridge 700 operates as follows. 
Consider three node users A, B and C initiating and 
participating in a video teleconference located at 
nodes 610, 612 and 614 in FIG. 6. One or more of the 
users A, B and C provides a plurality of video se- 
quences corresponding to a plurality of resolution lev- 
els by establishing virtual circuit connections to the 
multicast bus 708. Each user A, B and C further es- 
tablishes a virtual circuit to one of the logical bridge 
circuits 712 1? . . .712 n to receive video data there- 
from. Thus, user A could be connected to 712 1f user 
B connected to 712 2 , and user C connected to 712 3 . 

When each user is connected, each user pro- 
vides one or more multiple resolution video sequenc- 
es to the multicast bus 708. Logical circuits 712 1( 71 2 2 
and 712 3 each receive video data for all the video se- 
quences provided by each participant. The depacke- 
tizer 716 1f 7162 and 716 3 provide all this video data 
to their respective T/E devices 718 1f 71 8 2 and 71 8 3 . 

The T/E devices 718 1f 7^ and 71 8 3 controllably 
eliminate macroblocks from the data stream in order 
to provide only the macroblocks that correspond to 
the particular user's window definition. For example, 
if user A selects a full screen of B's video overlapped 
by 1/16 screens of A's and C's video, The T/E device 
718! eliminates macroblocks from the video se- 
quences containing all other resolution levels of A's 
video, B's video, and C's video. Moreover, the T/E de- 
vice 71 8i also eliminates macroblocks form B's full 
screen video sequence which are hidden by A's win- 
dow and B's window. 

The T/E circuit 71 8^ also translates the macro- 
block's screen location to conform to user A's window 
placement. In fact, the T/E circuits 718 1t 7^ and 
71 83 operate in much the same manner as the T/E cir- 
cuit 4 1 0 in FIG. 4 above, the only difference being that 
the T/E circuits 718 1t 718 2 and 718 3 receive the video 
sequences containing all the resolution streams from 
each participant and therefore must eliminate more 
video data. In addition, each user must provide con- 
trol signals defining a desired window configuration 
to the user's assigned logical bridge. 

The bridge circuit 700 provides advantages over 
the simpler ATM switching circuit 608 of FIG. 5. Be- 
cause the macroblock elimination function is per- 
formed by the bridge, less data is required to be trans- 
mitted to the receiving nodes. This reduction in trans- 
mitted data reduces the bandwidth required. Further- 
more, because the T/E circuit function is performed 
at the bridge, receiving nodes need not possess such 
circuitry. Hardware costs are thereby reduced. 

Another alternative bridge consists of a bridge 
similar to the bridge 700 with the exception that it con- 
sists of only one logical bridge circuit 712 1( In such a 
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case, the logical bridge circuit 71 2^ establishes a vir- 
tual connection with all the transmitting nodes and all 
the receiving nodes. No multicast bus 708 is required. 
The logical bridge circuit 71 2y receives all the resolu- 
tion level signals from all of the transmission nodes, 
not shown, through the network connection 714. The 
logical bridge circuit 712) thereafter performs the 
same function as discussed above, including per- 
forming the macroblock elimination and translation 
steps. The logical bridge 71 2 1 then transmits the re- 
sulting video signal to all the receiving nodes. 

Because only one logical circuit is provided, only 
one window configuration will be available to all 
users. While this limits the flexibility of the windows 
environment, it reduces the hardware required by the 
network bridge 700. 

Although the foregoing examples illustrate video 
teleconferencing applications, the method of the 
present invention may readily be applied to other mul- 
tipoint video environments. The use of multipoint en- 
coding and decoding technology according to the 
present invention enables any video telecommunica- 
tions application to use a multiple windows environ- 
ment, including interactive multiparty video games. 
Furthermore, the multiple windows video environ- 
ment may be achieved relatively inexpensively be- 
cause it requires a minimum of additional encoding 
and decoding hardware. 

In alternative implementations, one or more of 
the work stations 1 2, 22 or 32 of FIG. 1 may comprise 
a central video data base, a video chalk board, or 
even a cable television service provider, depending 
on the implementation. In many cases, such work sta- 
tions need not always operated as a receiving node 
and may simply connect with consumer receiving 
nodes to provide video information thereto according 
to the methods of the present invention. 

It is to be understood that the use of a three node 
system is also given by way of example only. Any 
number of transmitting nodes, whether or not employ- 
ing the multipoint encoder circuit of the present inven- 
tion can provide video data to any number of receiving 
nodes employing the multipoint decoder circuit of the 
present invention described in connection with FIGs. 
3 and 4 above. Conversely, any number of receiving 
nodes, whether or not employing the multipoint de- 
coder circuit of the present invention, can receive vid- 
eo data from one or more transmitting nodes employ- 
ing the multipoint encoder circuit 200 of FIG. 2. 

Details such as the use of NTSC signals, a 640 x 
480 pixel digital display, and specific subsampling 
rates and methods are naturally given by way of ex- 
ample only. Other standard video signals and/or other 
digital display arrangements may readily be imple- 
mented in the present invention by one of ordinary 
skill in the art 



Claims 

1. A multipoint video communication system oper- 
able to support multiple window video environ- 

5 ment, the system comprising: 

a) a plurality of video transmission nodes, at 
least one of which includes a multipoint en- 
coder circuit, the multipoint encoder circuit 
comprising 

10 a source of digital video data; 

means for generating digital video data 
comprising a plurality of resolution levels from 
a video signal connected to the source of vid- 
eo data; and 

15 an encoding device; 

b) one or more video receiving nodes for re- 
ceiving video data comprising a plurality of 
video sequences from a plurality of video 
transmission nodes, each video sequence 

20 comprising macroblocks of video data and 

further including distinct context information, 
the receiving node comprising 

a decoder circuit operable to decode 
received macroblocks of video data; 

25 means for providing to the decoder cir- 

cuit context information corresponding to the 
video sequence of which each macroblock is 
part; 

a display controller connected to the 
30 decoder circuit for driving a video display de- 

vice; and 

c) means for controltably connecting the plur- 
ality of video transmission nodes to one or 
more video receiving nodes. 

35 

2. The multipoint video communication system of 
claim 1 wherein one or more of the video trans- 
mission nodes is also a video receiving node. 

40 3. The multipoint video communication system of 
claim 1 wherein the connecting means compris- 
es: 

a multicast bus for receiving video data 
from the plurality of transmission nodes; 

45 one or more logical bridge circuits con- 

nected to the multicast bus; and a communication 
network connecting the multicast bus to the plur- 
ality of transmission nodes, the network further 
connecting the one or more logical bridge circuits 

so to the one or more receiving nodes. 

4. A multipoint decoder circuit operable to receive 
compressed video data corresponding to a plur- 
ality of video sequences to be displayed simulta- 
55 neously in a windows format, each of the plurality 

of video sequences including context informa- 
tion, said context switchable decoder comprising: 
input means for receiving video data cor- 
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responding to a plurality of video sequences; 

a decoding means for decompressing the 
received video data; and 

means for providing video context informa- 
tion to the decoding means, the context informa- 5 
tion contained in the video sequence correspond- 
ing to the video data being decoded. 

5. The multipoint decoder circuit of claim 4 wherein 

the means for providing video context information 10 
comprises a random access memory device. 

6. A video display system operable to receive pack- 
ets of compressed video data, composed of 
frames, to be displayed in a windows environ- 15 
ment, said video display system comprising: 

a depacketizer operable to receive pack- 
ets of compressed video data corresponding to a 
plurality of video sequences and further operable 
to produce a stream of compressed video data 20 
corresponding to the plurality of video sequenc- 
es, each video sequence comprising a plurality of 
frames, each frame comprising a plurality of mac- 
roblocks, the macroblocks including a macro- 
block identifier representative of the macro- 25 
block's location within the a frame of a video se- 
quence; 

a macroblock eliminator connected to the 
depacketizer, said macroblock eliminator includ- 
ing means for eliminating video data correspond- 30 
ing to hidden portions of the plurality of video se- 
quences; 

a macroblock translation means connect- 
ed to the macroblock eliminator for translating the 
macroblock identifier into a new macroblock 35 
identifier representative of the macroblock's lo- 
cation within the display corresponding to a user- 
defined windows configuration; and 

a context switch able decoding means con- 
nected to the macroblock el iminator, said context 40 
switchable decoding including means for decom- 
pressing the video data. 

7. The video display system of claim 6 wherein the 
macroblock eliminator includes: 45 

a) a RAM-based look up table; and 

b) a video data buffer. 

8. The video display system of claim 6 wherein the 
context switchable decoder comprises: 50 

a) a decoding device; and 

b) a context random access memory coupled 
to the decoding device to transfer video con- 
text information there between. 

55 

9. A multipoint video encoder circuit operable to pro- 
duce compressed video data corresponding to a 
plurality of video sequences, each video se- 



quence representing a different resolution level 
of one original video sequence, the multipoint vid- 
eo encoder circuit comprising: 

a source of digital video data constituting 
the original video sequence; 

means for generating video data compris- 
ing a plurality of video sequences corresponding 
to a plurality of resolution levels from one original 
video sequence connected to the source of video 
data; 

an encoding device; and 
means for preparing packets of encoded 
video data for transmission. 

10. The multipoint video encoder circuit of claim 9 
wherein the means for generating video data 
comprising a plurality of video sequences com- 
prises: 

a first anti-aliasing filter connected to the 
source of video data; 

a first write controllable strip memory cou- 
pled to the first anti-aliasing filter, 

a second write controllable strip memory 
coupled to the source of video data. 

11. A method of receiving video data corresponding 
to a plurality of video sequences, each video se- 
quence comprising a plurality macroblocks and 
further comprising distinct context information, 
the video sequences to be displayed simultane- 
ously in a windows format, the method compris- 
ing: 

a) receiving video data corresponding to a 
plurality of video sequences; 

b) employing a decoding device to decode 
one of the plurality of macroblocks from one 
of the plurality of sequence within the re- 
ceived video data; 

c) providing the decoder with context informa- 
tion that corresponds to the video sequence of 
which the macroblock being decoded is part; 
and 

d) repeating steps b) and c) for a plurality of 
macroblocks; and 

e) displaying the plurality of decoded macro- 
blocks on a display device. 

12. The method of claim 11 further comprising the 
step of providing position information representa- 
tive of the position in which a macroblock will ap- 
pear on the display device relative to other mac- 
roblocks which make up the plurality of images in 
the received video data. 



12 



EP 0 669 765 A2 




13 



EP 0 669 765 A2 




EP0 669 765 A2 




15 



EP0 669 765 A2 




CNJ 



16 



EP 0 669 765 A2 




17 



EP 0 669 765 A2 



700 



\ 



A 



708 



70 



712 



718- 720- 



712 



, 2 716, 



71 8 2 720 2 



712, 



V 



n.716„ 71 8„ 720. 



FIG. 6 

18 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 



Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 



□ BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

□ LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: : '. 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



BEST AVAILABLE IMAGES 




FADED TEXT OR DRAWING 



